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21 April 1961 


MEMORANDUM FOR: Deputy Director, Intelligence 


FROM Chief, DD/I Automation Staff 


oe 


SUBJECT . § Computer Needs 


Tne DD/I Automation Staff has completed ite initial survey of 
DD/I computer needs. The Staff has investigated numerous "special 
project" interests in the Research Offices of the DD/I, hae partici- 
pated in the development of Project CHURCHWAY, and has sponsored an 
intensive six-week survey of automation requirements by teams of 
specialists from IBM and the System Development Corporation (src). 


The FINDINGS of this Staff concerning your EDP needs and our 
RECOMMENDATIONS to you appear below. The following Annexes are also 


attached: ANNEX A - Summary of IBM and SDC Reports; ANNEX B - The 
DD/I Computing Regu trencoti ANNEX C - The Hardware Requirement; and 
ANNEX D - Est Requirements of a DD/i Computer Center. 

* + # * # 


FINDINGS 


A. Principal Findings 


1. There is a definite computer need in the DD/I Area. This need 
may be categorized into two basic types of computer applications: 


ea. "Special project" applications within the Research Offices 
involving statistical or other manipulations of data and relating 
to specific intelligence problems, and 


b. Document and information retrieval applications involving 
your major document collections and relating to the entire spectrum 


of intelligence interests. 


2. The "special project" applications are for the most part 
entirely suitable to present-day electronic data processing (EDP) capa- 
bilities. Data preparation and programming will, of course, be required 
but extensive system analysis and design are not typically involved. 
With the establishment of a DD/I computing capability, implementation 
of these special projects could proceed directly. 
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3» The document and information retrieval applications will require 
an extensive system analysis and design effort. This effort should be 
considered experimental and developmental. Computer gupport should be 
available to this effort. The great size of your central reference 
files and the broad scope of consumer requirements relative thereto 
pose the prospect of equipment and systen requirements which cannot 
at present be fully satisfied. 


4. There is every reason to anticipate a repid growth in the pp/1 
computing need. The "special project” applications will certainly 
expand as capabilities are established and demonstrated. With the 
design of a successful system for the document and information 
retrieval function, a very massive workload will result. - There are 
three additional factors which will foster a growing reliance upon 
computers in Ture: : 


a. The advanced collection programs for ELINT, RADINT, and 
COMINT are being designed to acquire information in analog and 
digital form. With corresponding equipment development on the 

‘ Gata-analysis side, most of this information can be made available 
immediately for computer processing, thereby greatly speeding the 
realization of intelligence gain from these sources. 


b. The growth of computer capabilities within the Intelligence 
Community will lead to an increase in the exchange anong community 
members of information recorded. in machine language formats. ~ 


‘quarters and in the field will make available in machinable form 
vast amounts of intelligence data without additional preparation 
effort. , pa, ae 


c. The increased use of tape-generating typewriters at -Head- 


5. There is a need for. basic research in intelligence applications 
for computer capabilities. Such research would serve ‘to keep the CIA 
computing effort dynamic and imaginative, 


6. The DD/I, and indeed CIA, is very short on the ekills required 
to implement and operate a computing facility. An extensive training 
effort for Agency personnel will be required. In addition, we mst 
rely for some time to come on outside sources for certain necessary 
skills. 


T+ Centraligation of EDP Staff functions and of c uting facili- 
ties will tend to maximise the usefulness of hardware, software 


supporting computer programs and operating procedures), and the 
limited human skills available to us. 
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8. Years of effort, experience, and policy support are required to 
achieve an efficient and effective computing capability. Given a vigor- 
cus start now, it will still be 1964 or later before the major potential 
of a DD/I computing capability will have become manifest. 


9. Top Management support is absolutely essential to any effort to 
establieh a DD/I computing capability. such an effort will require 
. Money, people, space, etc.; it will also eventually influence opera- 
tional and organizational structures. The latter consequence mt be | 
expected and faced squarely from the start. 


10. It should be stressed that the successful prosecution of the 
extensive system study of the DD/I document and information retrieval 
function (paragraph 3 above) will require the full Management approval 
of those Offices affected (OCR is the principal such Office). Exper- 
lence in this area shous that the best experimental efforts of this 
kind occur when the pereonnel and management of the organization con- 
cerned are enthusiastic about the objectives and undertake the major 
share of the work themselves. Without this climate, the wisdom of 
atarting such a laboratory is questionable. 


ll. The establishment of a computer capability for the DD/I is 
going to cost money and people. Such a facility proffers not only s 
better way of doing old things, but also a new facility with new 
capabilities. Any decision you make to go ahead on the computer front 
should be based on your basic need for more scphisticated data process~ 
ing capabilities and not on the hope of saving money. 


B. Background Findings 


1. Computer capabilities are actively being explored and developed 
now in every major component of the Intelligence Commnity. ‘The impact 
of computers on the structure and operations of the: Intelligence Com- 
reer is already being felt and will grow dramatically in the years 
8 ° 


2. The scope of Commnity efforts in the computing field today -- 
and, thereby, some suggestion of the impact for tomorrow ~~ is indicated 
by rea following list of computers now in use by the Intelligence Conm- 
munity: 


NBA -con--- Four IBM 7O04's; three IBM 705's3; IBM 650; 
UNIVAC 11033 IBM 1401; Harvest on order 


Air Force-- Three IBM 7090's; four IBM 1402's; three Minicard 
systems 


Army ------ Sylvania 94003 CDC 1604 (pop-pac) 
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Havy coonne TRAM 1401; IBM 70909 


CIA -ew-e=~ IBM 650 (DD/P); RCA 501 (DD/S)3; ALWAC (NPIC); 
one Minicard system 


3. The DD/P has been at work for four years now in designing and 
implementing an advanced, large-scale document and information retrieval 
system (Project WALNUT). ‘This system ts designed to satisfy internal 
DD/P needs and will not possess the additional capacity to meet DD/TI 
requirements. Hardware under development for thia Project may have 
applications in the Dp/T. . 


4. The RCA 501 computer of the DD/S is now operational. The DD/I 
has and will be given some time on this computer although time avall« 
able to the DD/I will be far from adequate to handle the DD/I workload. 


9+ MPIC has a emall, special purpose computer {ALWAC) for its 


photogrammetric and other applications. ‘his computer does not consti« 
tute a significant EDP capability. 


6. The DD/I pioneered the application of machines to intelligence 
tasks with the establishment a dozen years ago of ite OCR punched 
card system. It has not, hovever, kept pace with other ¢lements of | 
the Community in secomegeiierntion (computer) equipment. 

* * * * % 


RECQCENDATIONS 


The DD/I Automation Staff recoamenis that you: 


1. Decide now to comnit your Area to a major EDP effort and to 
establish a DD/I Computing Center in the New Building. 


2. Create a new Operating component, which shall constitute the 
DD/I Computing Center, to support total DD/I requirements for EDP 
services. 

3. Approve the following short-term objectives for the Center: 

a. To apply EDP capabilities to DD/I Special Projects. 


b. To perform EDP experimentation with the document and 
information retrieval requirement, 


Cc. To initiate a broad, EDP-oriented, system design study 
for optimizing reference capabilities in support of user reguire- 
ments; 
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4. Approve the following long-range objectives for the Center: 


a. To implement and maintain EDP programs devised to support 
evolving DD/I functions. ; 


bo. To develop and maintain those human skilis required for 
a dynamic computing program for the DD/I area. 


c. To moniter advances in computer technology and to plan 
and provide for the future equipment requirements of the DD/I 
area as the DD/I computing program develops. 


a. To sponsor appropriate R & D programa in the application 
of computer capabilities to intelligence processes. 


@. To serve ag the focal point of representation for the 
DD/I in EDP matters. 


5+ .Direct the DD/I Computing Center to: 


a. Commence staffing efforta against a first-year personne) 
authorization of approximately thirty-five (35) persons. 

b. Implement the DD/I EDP program in conformance with the 
short-term objectives in paragraph 3 above. 


ce. Procure the use of outside contractual services to the 
extent of approximately twenty-five persons for training, system 
design, programming, etc. . 


ad. Issue a letter of intent for the procurement of EDP 
equipment on a rental basis. 


6. Obtain priority support from the DD/S Data Processing Staff 
to assist in the DD/I planning and implemen effec 
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SUMMARY OF IBM AND SDC REPORTS 
eaten anertan ne ein eerie 


Summarized below are the principal recomnendations contained in 
the recent ISN and SDC surveys. The cost-free IBM and SDC reports 
were prepared to assist us in the formulation of a sound DD/T plan 
of action. 


2. IBM restricted the scope of ita effort to studying OCR 
functions. SDC, however, was asked to survey, in eddition to OCR, 
the possible computer needs ("special projects") of the np/T research 
offices. 


2. The IBM PAPER makes suggestions for EDP within BR and SR and 
recommends a long “range systems analysis and hardware development 
program. Specifically, IBM finds that: 


&. OCR punched card systems have reached their optimum 
effectiveness and the time is at hand to implement conversion to 
second-generation (EDP) systems; 


‘b. A simplified WALNUT document image device offers promise 
for certain OCR Piles; 


GC. A WALNUT index device (large capacity, random access, 
magnetic storage) is applicable. to a number of index files in 
OCR, especially BR; 


' @. SR should be used as a "proving ground" for the develop- 
' Ment of an advanced OCRewide data processing system; 


6; A short-term, cost-free study should be conducted Jointly 
by IBM and CIA on SR and BR to plan the conversion of selected 
functions from punched cards to magnetic tape. EDP equipment 
should be procured and conversion effected as indicated by thie 
atudy; 


t. A long-term over-all system analysis should be undertaken 
by competent systems engineering personnel to define detailed 
requirements for an advanced OCR data processing systen. 


3+ The SDC PAPER stresses comprehensive planning plus top manages 
ment support as Indispensable prerequisities to the introduction of 
Computers within an organization. It says that the need exists now 
_ for an intensive systens study aimed at the design of a ney information 
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processing system for the 1965-1970 time scale. SDC devotes consi- 
Gderable attention to the managerial aspects of getting going, and 

‘ concludes that EDP equipment should be ordered for a DD I computing 
facility as soon as possible. Its principal recommendations are that: 


- @ "Special project” needs in the research offices represent 
adequate justification now for the establishment of a DD/T 
computer facility. 

b, A part of this facility should be established to perform 
experimental work on advanced OCR document and information 
control systems based, at the start, on the files of SR. 


. @e CIA staff up for.an Agency data processing program, with 
aveide support,.to lay the foundation for an integrated CIA 
information processing syetem by 1965. 


- he Short-range improvements to the existing OCR systems _ 
should be minimized. 


‘@. A permanent EDP Planning Staff, reporting to the highest 
_ feasible administrative level, should be established to advise 
ani assist top management in all areas of information processing 
in CIA. : 


f. A broad training program should be initiated to develop 
data processing knowledge and skills in CIA. 


g- CIA should monitor and support research and development 
efforts. in the EDP field on an increased scale. 


h. External support should be obtained to assist CIA ia 
(1) the proposed over-all system study, (2) the development of 
an advanced document and informetion control system in OCR, 
(3) the formation of computer solutions to research office 
"special project" needs, and (4) the training of CIA personnel. 


4. There is considerable contrast between the IBM and SDC 
papers in the approaches advocated; namely: 
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Suggests a cautious and Stresses that computer capa- 
gradual transition from present bilities require the struc tur- 
methods to EDP. ‘ ing of an entirely new systen. 


Recommends BR and SR convert Warns repeatedly shina 
certain present operations to "patchwork" solutions. 
EDP now. 


Offers to Aaveicp new hardware . Recommends the development 
with CIA support. of new systems with CIA 
. suppart. 


Pays tribute to the systers _ Suggests that major portion 

competence now on hand in OCR. of skills initially required 
by CIA mist be obtained from 
external sources. 


d- In spite of the divergence suggested by the above, these papers 
have many fundamental ereas of accord between them. For example, both 
TBM and SDC conclude that: 


[povmeew ibe sanactnmeeenernsytone teeter on AB roe eeeer cabana enguaitenem asinine 
EDP is definitely applicable to the functions 
of OCR and the me: 


- The DD/T should start now to move in the direc- 
tion of eisai a strong EDP copehility: 


The ita should decide now to acquire EDP 
eeiipeeat for the DD/I. 


A developmental effort should be launched now 
intended to devise a computer-driven document 
and information retrieval system auitable for 
general application within OCR and perhaps 
other areas of the DD/I. 
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EDP training ia of strategic importance to 
Agency personnel. 


An extensive system design effort should. 
be undertaken to achieve optimum edvantage 
of ras EDP capabilities by the mid-decade 
period. 
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ANNEX B 
(Section 1) 


THE DD/I COMPUTING REQUIREMENT 


GENERAL 


A. The DD/I computing requirement may be categorized into 
two basic types of computer applications: 


1. Special Project applications within the Research 
Offices involving statistical or other manipulations of data and 
hormally relating to specific intelligence problems. 


2. Document and Information retrieval applications 
involving major document collections and relating to the entire 
spectrum of intelligence interests. 


B. . There is every reason to anticipate a tapid growth in 
the DD/I computing need. The Special Project applications will 
certainly expand as capabilities are established and demonstrated. 
With the design of a successful system for the document and 
information. retrieval function, a very massive workload will 
result. 


C. There are additional factors which will foster a grow- 
ing reliance upon computers in the future: 


_ 1. ELINT, RADINT, COMINT, and other collection programs 
are approaching a capability for rapid, if not similteneous, 
conversion of analog data to digitalized forms required for 
computer processing. Once this capability is achieved, informa- 


‘tion from these collecting programs can be made availeble for 


immediate computer processing, thereby speeding the realization 
of intelligence gain from these sources. 


2. The growth of computer capabilities within the. 
Intelligence Commnity will lead to an increase in the exchange 
among community members of information recorded in machine 
language formats. : 


3- The increased use of tape-generating typewriters at 
Headquarters and in the field will make available in machinable 
form vast amounts of intelligence data without additional pre- 
paratory effort. 


4. A general up-grading in the level of sophistication 
in data handling activities is now occurring throughout the - 
Intelligence Community and will certainly continue for some time 


Bel 
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to come. This results in large measure from the introduction of 
computers into the Coumunity'’s data precessing programs. CIA 
will desire to and should keep up with -- if not lead -- this 
development. 


D. At the present juncture, the DD/I has no computing 
capability (with the exception of the small-capacity ALWAC 
machine in the NPIC). The DD/T is lagging behind virtually 
all other elements of the U. S. Intelligence Comminity in the 
application of computing resources to the intelligence task. 
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ANNEX B 
(Section II) 


‘ II. THE SPECIAL PROJECTS REQUIREMENT 


A. DEFINITION 


1. The term "special projects” as used in this paper refers 
to individual file and date processing needs which by nature fall 
outaide the normal machine support systema of OCR (hence "special") 
and which appear suitable for machine handling. "Special projects" 
are usually characterized by a relatively circumscribed data base 
of extracted information, by updating requirements, by intensive 
manipulation of the dats base, and by periodic report«generation 
(print-out of data by machine}. Most special projects are located 
in the DD/I research offices. _ 


2. Special projects constitute one of the two basic types of 
potential computer applications within the pp/. I Area. The DD/I 
Automation Staff has recommended the establishment of a computing 
capability to handle the special project need (Recommendation 3a). 


B. GROWTH IN ANALYST INTEREST 


1. DD/I intelligence analysts frequently encounter file or 

; data processing requirements beyond their own capabilities and 

ww not accommodated by the central reference system. This is natural 
and not necessarily a reflection on the established reference 
system. It is to be expected that special data processing require- 
ments will always be a part of the DD/I analytical scene... and, 
indeed, that the number and nature of such requirements will vary 
directly with the degree to which the pp/ I intelligence effort is 
imaginative, dynamic, and vital. 


2. Central to the special project concept is the handling 
and recall (in forms usable by the intelligence analyst) of 
information. It is the analyst's need for quick access to 
information, rather than documents » and, even more important, 
his frequent need to manipulate such information in divers and 
often sophisticated fashions which bas generated the expanding 
interest in special project services. There is every reason to 
anticipate a continued growth - if not, indeed, an explosion - 
in the demand for machine-driven research support of this nature. 
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_3¢ There are many factors fostering development of the DD/T 
interest in special projects. Some of the nore basic are: 


a The Agency's doounent colleations have grown to such 
sizes 1t is often impossible for the research analyst to exploit 
the large number of doousents provided to him in response to his 
requests. 


be The period of exhaustive, basic research within the 
DD/AL has Largely come to an end. The ponderous 200-300 page - 
study, based on detailed analysis of all available materials, is 
no longer being written. ‘The DD/L analyst's reports are now 
much briefer, their topics more clremsoribed, and their chrono- 
logical span more restricted. ‘The DD/L analyst is row endeavor~ 
ing to update what has previously been published... to report 
ohana in status. He ia increasingly concerned with current 
r ° @ is often working against very short deadlines. 
le has om need and less tima for methodical. perusing of docu~ 
ments. : 


c. Data analysis techniques employed in the DD/AT are 
becoming more sophisticated. Statistical analysis, extrapola~ 
tion and interpolation, model building, etc, are assuming 
inereased importance to the DD/I analyst in his efforts to pro- 
duce better intelligence, Such techniques are often diffieult 
or impossible to use without, machine support. The availabliity 
of a computing capability within the DD/L will give further 
Ampetus to the use of such techniques. 


d, The DD/I is now placing heavy euphasis on special 
weapons intelligence. The intelligence effort in this field 
often requires the sifting of large quantities of relatively 
low grade materials in order to extract pertinent fragnents of 
data, These fragments must often then be sorted, correlated, 
or otherwise manipulated in the effort to generate an intelli- 
gence picture. At the sane time, the analyst is faced with an 
inereasingly urgent denand for timelinessin his reporting, It 
is only natural that, faced with these requirements, the special. 
weapons analyst has turned toward the machine for support. 


e. In several other parts of the Intelligence Community, 
computing resources are today being epplied to data problems 
generically similar to those faced by our DD/I analysts, Our 
analysts are, of course, exposed to these computing efforts. 
through contact with their counterparts in other organizations. 
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Their exposure to these efforts has done much to spark a desire 
within our DD/I Offices to bring similar resources to bear upon 
their own problems. 


C. GROWTH IN MACHINE CAPABILITIES 
haste detente em eneintee ttt een ge Te 


1. While the need for special project support has been grow~ 
ing, the capabilities of machines to assist in this type of effort 
have been dranatically extended. With the advent of the computer, 
an entire new dimension in the storage, handling, analysis, and 
retrieval of infomation by machine has resulted. Computers pro- 
vide a greatly Increased power to store in machinable form large 
masses of information, to search out and locate desired information, 
to manipulate data in intricate and logically sophisticated fash- 
ions, and to print out information in natural language and with 
full command over the editorial formats of print-out. All of these 
functions are performed, of course, at very high speeds.: The 
appreciation of the above on the part of the DD/I Offices has 
naturally served to stimulate further their interest in the spe- 
cial project aspect of the DD/I data processing outlook. ' 


Do TYPES AND EXAMPLES OF SPRCLAL PROJECTS 
are aerercainrieneenemeammnen emt ene in ticlintinsinitan tet 


1. The DD/I Automation Staff has identified several potential 
special projects in the DD/I Area. There is considerable varia 
tion among these projects in terms of data volumes, nature and 
Complexity of data manipulation and output requirements, ease of 
implementing for computer processing, importance of the intelli- 
gence problen to which they relate, prospect of intelligence gain, 
etc. These projects do. not lend themselves to clear-cut and dis- 
tinct categorization. It may be helpful, nonetheless, to indicate 
‘three major types of applications represented with examples of 


each: 
a File Maintenance and Report Generation 


This type of special project application is charac- 
terized by medium-to~-Large input/output volumes and mini- 
nal intemal processing requirements, Such projects are 
relatively easily planned and activated, Programming 
requirements are standard and often satisfied by library 
routines supplied by the equipment manufacturer, Large 
machine memory is not normally necessary but strong in~ 
put/output and peripheral equipment capabilities are 
required. Preparation of input data for machine processing 
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is typically a sizeable effort. 


bd Examples 
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be Mathematical and Statistical Analysis 


Input/output volumes are, in general, more moderate 
and internal processing requirements greater for this 
type of application than is the case with file nainten= 
ance tasks. Most of the mathematical techniques involved 
are well known and established; some are more advanced, 
however, suchas numerical approximations to dynamic 
models. The formulation stage is considerably more com- 
plex for this category of applications. Extensive con- 
sultation with the sponsoring intelligence analysts will 
be necessary and greater analytic capability on the parts 


ad of the formulators and the computer programmers is re- 


quired. The progranming requirement is typically com-: 
plex. Large amounts of machine memory may be required. ~ 
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Go Advanced Systems 


‘Applications falling into this category will tend to 
be large-scale computing systems of complex design, using 
advanced hardware and advanced techniques. These appli- 
cations should be considered in large part developmental. 
All aspects of data handling and computing may be involved, 
from the most simple to the most challenging. These pro~ 
jects or systems will be characterized by such aspects of 
computing as: pattern recognition and analysis, hypothesis 
testing, machine "learning", automatic syntrtic analysis 
of input data, advanced visual displays, remote inquiry 
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and iman/machine interaction, ete. This type of applica- 
tion stresses creative participation in the system by the 
intelligence analyst and taxes the automation potential 

to the utmost. System design, computing procedure fornu— 
lation, programming, hardware configuration, etc. will 

be expected to progress through several stages of develop~ 
ment and interplay before maximization of the project is 
achieved. 


Examples: 


(1) Project CHURCHWAY 
(2) Project SNOWPLOW (CLA Counterpart ) 
(3) National Indications Project 


E. PROSPECT 


1. The data-processing requirements of special projects are, 
for the most part, entirely compatible with present-day EDP 
capabilities. Data preparation and programming will, of course, 
be required but extensive system analysis and design are not 
typically involved. Much of the work of implementing selected | 
projects can be carried out prior to the installation of computing 
hardware. Analysis of input data and project objectives, projec% 
design, preparation of input data for machine processing, writing 
of computer programs, etc. can start as soon as wanpower becomes 
available. De-bugying of programs written during this period can 
be carried out on equipment available elsewhere in the Washington 
area. Thus an operational capability: for special projects should 
be a reality directly subsequent to installation of ‘hardware and 
engineering checkout. 


2. At present, the DD/I is not organized to handle the 
special project need of its analysts. To the extent this need 
has bean met, it has been met by superimposing the special pro~ 
ject workload upon organizations staffed and equipped for other 
activities. The Special Register in OCR has been the most agrres= 
sive component within the DD/I in the special projects area and 
has done much with its punched-card equipment and limited man~ 
power resources. But support to this need falls far short of 
what is required for a dynamic and advanced special projects 
capability for the DD/I. The Special Project need can be ade~ 
quately provided for only by establishing an organizational ele- 
ment charged with meeting this need and equipping that component 
with EDP capabilities. 
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ANNEX B 
(Section Tit) 


THE DOCUMENT AND INFORMATION RETRIEVAL REQULREMENT 


Before computer capabilities can be applied to the DD/T 
document and information retrieval function, a comprehensive 
system design effort must be conducted. The objective of this 
effort will] be the structuring of an advanced, EDP-orlLented, 
document and information retrieval system which will satisfy 
the real reference needs of the DD/L for the 1960 decede. This 
system design project should be computer—supported. (Hecanmenda-- 
tions 3b and 3c) 


at it # 


A. BACKGROIND 


lo The system design undertaking recommended by the DD/T 
Automation Staff will be DD/I = wide in scope. The organizational 
Component most directly concerned, howaver, will be OCR... where 
the docunent and information retrieval function is largely cen- 
traliszed. 


2, The OCR reference system is wique in the seope and com 
plexity of its task. From the earliest days OCR management has 
sought to exploit the potential of machines to assist in this 
gigantic docusent and infomation control effort. OCR has, over 
the years, developed machine indexing skills in the area of its 
responsibility which are unequalled elsewhere in the Community. 


3e OCR is now operating a series of very large-volume, 
punched~card index systems.. Each of these systema has its 
strengths and each has its problem areas, It is easy and per- 
haps natural for the user of OCR services to find fault with the 
reference responses received from these systens, Unfortym ately, 
however, many users’ criticisms are parochial in nature ana do 
not pave the way to real and substantial improvements in the OCR 
system. 


B. PRORLEY AREAS 
1. The above is not to say that real difficulties do not 
exist in the OCR system, or that our reference services cannot 
be improved. There are several problem areas of concern to OCR 
managenent a3 well as user components. Many of these problen 
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areas huve been examined and eomented on by past survey groups 
and it wold be redundant here to dwell on then. DD/I officials 
and analysts are well aware of such problems as: multiple serv— 
ice points within OCR; imprecise document retrieval; "wheel- 
barrow" volume of system responses 3 incomplete index coverage; 

ad hoc processing techniques which compromise system integrity; 
duplicative input processing among OCR components and, more 
serious, redundancy in documents furnished by these multiple serv~ 
ice points to a given consumer on a given problem; inadequate 
capability for information (vs. document) retrieval; inability 

of the system to respond effectively to certain new or special- 
ized recovery requirements; limitations on the length of the 
machine record and therefore the amount and form of data recorded; 
coded information output instead of natural language output; 

space requirements, etc. 


2. Some of the above problema have reached such serious pro= 
portions as to endanger certain of the fundamental objectives of 
the OCR service system. Efforts to attack these problems have 
been made largely within the context and contraints of an OCR 
essentially uncharged since its original establishment. Al~ 
though punched-card speeds and capabilities have increased over 
the years and OCR has made certain system improvements - some 
of considerable significance -, the fact remains that the over- 
all reference effort is operating today in terms of the basic 
organizational, schematic, and equipment characteristics of a 
decade ago. 


C. PROGRAM 


1. Itis time for a serious, intensive, and comprehensive 
study of the DD/I docunent and information retrieval funetdion 
and for a major effort to design a new and advanced docwment 
and information retrieval system. It is time because: 


ao ‘the OCR refarence system has not changed basically 
since its inception. 


b. Real problems exist today. 


c. The solutions to these problems do not appear to 
lie within the tolerances of the present OCR systen. . 
OCR has worked diligently to maximize the effective— 
ness of its operations and it therefore seems un- 
likely that new and major extensions of OCR’s capa- 
bilities can ve realized within ar present sys- 
tematic, organizational, and hardware posture, 
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d. Second-generation equipments (computers) are now 
available which offer dramatic new data processing 
capabilities. The new dimensions in hardware capa- 
bilities argue for a design of new dimensions in 
system capabilities. 


@. Advarces with computer applications elsewhere in 
the Intelligence Community are such that the central 
position of OCR in its field of responsibility may, 
indeed, soon be placed in jeopardy without such a 
study and design effort. 


2. Let it be squarely faced, however, that EDP offers no 
guarantee of success. EDP is expensive. It is complex. Its 
application in the document and information retrieval field is 
still largely wmcharted, EDP will place new and exacting demands 
on the akill and ingenuity of our personnel and on the struc- 
ture and integrity of the data processing systems we evolve. 
Experience has demonstrated, however, that machines are essen~ 
tial to the sophisticated control of large~volime document and 
information collections, An extensive system design effort, 
enbodying an upgrading of machine support from first-generation 
(punched card) to second-generation (computer) capabilities... 
and building on hardware capabilities which are today only 
anticipated... does appear to constitute the logical next step 
in the evolution of our DD/L reference system amd, as such, 
offers the best prospect of success. 


30 The DD/I Automation Steff has recommended a broad, EDP- 
oriented, system design effort (Recommendation 3c) and an EDP 
experimentation program (Recommendation 3b) for the docunent 
and information retrieval function to be conducted by CIA and 
contractor personnel. | 


a The basic objectives of the system design effort are: 


(1) To anticipate and define the proper, consumer- 
oriented, goals of our reference service for 
the 1960 decade. 


(2) To devise an advanced, computer-driven systen 
to fulfill those goals. 


(3) To evolve methods and techniques for perforn- 
ing, on an integrated basis, the constituent 
functions of the system. 
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b. 


(kh) 0 determine the informational, processing, 


(5) 


personnel, and equipment requirements of the 
systen. 


To construct an over-all implementation plan 
for the system, - including, where applicable, 
conversion, parallel operation, and phase-out 
of present systems. 


The basic objectives of the experimentation program are: 


(1) To provide a laboratory for operational devel~ 
opment and computer testing of concepts and techniques 
surfaced by the system design effort. 


Certain of these experimental areas can 

be anticipated; e.g., new index languages 
based on normal English words, to fecthie 
tate data input and permit direct output 
from the machine in natural language form; 
table look—u lications to permit auto- 
matic conversion from natural language to 
digital code and vice versa; automatic 
indexing to permit the indexing by machine 


o ormation available in machinabie 
forms computer testing of divers schemes 


of file organisation; automatic dissemina- 
tions; and, perhaps, automatic abstracting. 


(2) To plan and carry out an experimental conversion 
operation for the Special Register system. 


This conversion operation will be based 
on present SR input indexing. Input data 
will be restructured, however, to take 
advantage of EDP record lengths and search 
capabilities. Both serial sequence (tape 
reels) and direct or random access index 
storage techniques will be simulated and 
evaluated. 


(3) To plan and test on the computer an experimental 
model of an all-source, all-topic, document and infor- 
mation retrieval system. 


This model will serve to test the feasibility 
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and capabilities of a single, integrated, 
OCR-wide document and information retrieval 
system. 


(kh) To plan and test models of systems structured 
by topic rather than source document (e.g., a blo~- 
graphic information system), and other such models as 
is appropriate. 


(5) To experiment with methods of computerizing 
selected sub-systems within the OCR Registers as. 
seems profitable. 


(6) To foster the development within the DD/I of 
computer-related human skills in the document and 
information retrieval field. 


hk, An important segment of both the system design and experi- 
mentation programs will be the structuring and testing of a pingles 


all-topic system for document and information retrieval whic 
provide for: 
ae A single reference point for OCR, regardless of topic 
on which reference support is desired. 
The customer would place one integrated request 
which would activate one integrated servicing 
effort which would, in turn, produce one inte- 
grated service product at one point in time. 
The desirability of such a reference support 


structure has repeatedly been expressed by OCR 
Consumers. 


b, The elimination or reduction of multiple input proces— 
sing by providing for a single input effort to control 
ail elements of a given doewent, 
The multiple input processing which sometimes 
occurs within the present OCR system consumes 
manpower and thereby contributes to the pro-. 
cessing backlogs and ad hoc filing and control 


arrangements which have taken on such serious 
proportions. , 


c. An improved "pertinency ratio" for docwments or 
information recovered from the system, 
"Bel 
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An all-topie control within a single, homo- 
geneous system, preserving valid data relation- 
ships and providing for flexible recovery by 
logical combinations of data, would enhance 
both the subtiety and the specificity of docu- 
ment and information recovery. 


ad. The expansion of information (as distinguished from 
document) retrieval capsbilities of the OCR machine system. 


An increased information retrieval capability 
would speed the research analyst more directly 
to the data he needs and would reduce the docu~- 


ment retrieval requirement. This is a particu- 
larly fertile area for development. 
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ANNEX C 


ISTIER OF INTENT 


The DD/I Automation Staff has recommended that a letter of 
intent be issued as scon as possible for the delivery in approxi- 
mately twelve months of EDP equipment on » rental basia for the 
DD/I Computing Center (Recommendation 94). 


GENERAL CONSIDERATIONS IN SEIECTING EQUIPMENT 


The following general considerations bear upon the selection 
of computing equipment: 


A. The equipment should be well-proven to insure 
reliability. 


This will promote maximum utilization of 
‘available machine time and minimize re- 
runs due to machine malfunction. 


B. The equipment should be modular in design. 


"Modular" refers to the building-block 
appreach in the design of equipment. 

Such design contributes to ease and speed of 
maintenance and permits alteration of system 
capabilities. 


C. Equipment which is widely accepted and used should 
be selected. 


Such equipment will carry with it a strong 
"software" capability. ("Software" is the 
supporting programming and system documenta- 
tion necessary for efficient utilization of 
any EDP complex. It includes manuals, service 
routines, high-level programming languages, 
and a wide variety of operating procedures. 
The ready availability of required software 
ean expedite and greatly reduce the cost of 
operations.) 
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D. The equipment should be able to accept as inputs 
the machine language products from other data processing computer 
systems in the Intelligence Community. 


This means either the equipment should be 
of the same family as the majority of other 
systems or that it met have the capability 
to convert products and programs from these 
other systems into a form acceptable to the 
equipment. 


E. Equipments ecquired should be from the same family 
(generally this means the seme mamfacturer). 


This assures functional compatibility within 
the computer complex - an important consi- 
deration from both the hardware and software 
points of view. 


F. The family of equipment should include a wide 
variety of modules or elements, both in type and capability. 


This increases flexibility in facilitating — 
interchange and regrouping to accommodate 
changes in processing or organizational 
requirenents. 


G@. The chosen manufacturer should have extensive 
research capabilities for continued developments of their product 
line. 


This allows the user to plan for more advanced 
capabilities in the future with some confidence 
that new equipment developments will proceed 
apace. 


DD/I CONSIDERATIONS 


A. Computing applications presently identified within 
the DD/I fall into two basic types: special project applications 
end document and information retrieval applications. Most medium 
to large scale computer systems could handle both types of 
requirements. Certain of the special projects to be activated, 
however, are better suited to implementation on a scientific 
computer. The document and information retrieval applications 
and certain other special projects are better suited to imple- 
mentation on a data processing computer. 
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The term "scientific" computer is generally 
used to denote a computer system having 
extremely powerful arithmetic capabilities 
built into its design - such as fixed word 
length memory and parallel transfer. A 
system of such design is better suited to 
performing mathematical computations than 
to manipulating large volumes and varieties 
of alpha/numeric and natural language data. 
The "data processing” computer, on the other 
hand, is engineered to handle well these 
varied alpha/numeric and natural language 
dats but is less suited to performing 
mathematical computations. 


B. The above suggesta acquisition of two computers 
for the DD/I Computer Center: a scientific computer and a data 
processing computer. 


C. It is the opinion of the DD/I Automation Staff, 
however, that the initial letter of intent should provide for 
one computer only. It is also the Staff's opinion that, in 
view of the over-all nature of the DD/I requirement, the computer 
ordered now should be in the data processing class rather than 
a scientific computer. 


D. In addition to the general selection considera- 
tions in paragraph IT above, the following specifica should be 
looked for in selecting the initial computer system for the DD/I: 

1. Plexible and powerful input/output capabilities 
(accepting punched cards, paper tape, and 
Iagnetic tape, and having high speed 
printer) 

2. Plexible and efficient data manipulation capa- 

%ilities within the main frame , 
(variable word length and character access) 

3. Large random access storage capacity 

hk. Numerous (15/4) magnetic tape drives 


5. High speed, high density magnetic tape 
characteristics 


6. Medium size core memory 
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7. Extensive buffering 

8. Extensive instruction repetoire 

9. Adequate complement of registers 

10. Parallel processing/time sharing features 
11. Good interrupt features 

12. Medium arithmetic capability 

13. Effective console control 

14. Typewriter inquiry station 

15. On line/off line switching capability 


E. The Staff's conclusion that only one machine should 
be ordered at this time is based on the judgment that additions 
to the hardvare complex of the DD/I Computer Center should be — 
considered after we have gained more experience in the DD/I 
effort and after the direction of the proposed system design 
effort on the DD/I document and information retrieval function 
has become manifest. The expected availability of time on the 
pD/s RCA 501 computer for DD/I applications has censtituted a 
aignificant factor in reaching this conclusion. . 


F. With the expansion of the DD/I special project 
effort, the prospective need for a scientific computer must be 
recognized. ‘The development of a computer-driven system for 
the DD/I document and information retrieval function will clearly 
generate major new hardware requirements. CIA management should 
be aware of this outlook. ; : 
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TTEM pe: Contractor Personnel 


A contract should be let with a suitable outside organiza- 
tion to provide needed skills not avallable within the Agency. 
The main aim of the contract will be the redesign and structuring 
of an EDP-oriented information processing system for the DD/I. 

‘Approximately twenty-five (25) people should be so engaged at the 
start. The requirement of an additional ten to fifteen people 

is anticipated for the second year. Skills provided should in- 
elude: training specialists, system analysts, programmers, and 
EDP managerial specialists. This group would be expected to reach 
ites greatest strength during the second year. The contractor 
group should taper off to the zero point by the end of the fourth 
year. This group will work closely with CIA personnel and in 
support of CIA personnel. 


ITEM #32 Equipment Rental 


A letter of intent should be placed as soon ag possible for 
the rental of suitable EDP equipment. Installation of this equip- 
ment will take place during FY 633; therefore, no funds for this 
purpose will be required during FY 62. 


Punched card and punched paper tape generating equipment 
will be required from the start to support the preparation of 
input data and computer programs for special projects and experi- 
mental activities on the document and information retrieval effort. 
Two card punch machines and two card verifier machines should 
be rented; it will probably prove advantageous, however, to pur- 
chase the paper tape generating equipment. 


TTEM its Supplies and Equipment 


Supplies - Supplies to support the computer center will 
includes: paper tapes, magnetic tapes, punch cards, continuous 
form printer paper, operational control forms, etc. 


Equipment - Two paper tape punch-verifier machines should 
be purchased to support the Special Projects and Document and 
Information retrieval programs. A variety of ‘housekeeping items 
(card and tape file cabinets, special work tables and trucks, 
paper separating machines, etc.) will be needed. ~ 


ITEM #3: Programming Support 
Programing capabilities will be developed within the CIA 
staff. The contractor organization will supply programing 


skills, as well. It is anticipated, however, that programming 
requirements will occur, particularly during the early years 
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of this EDP effort, which will best be met under purchase order 
or service contract with an outside organization. 


TTEM #6: Outside Training 


Two basic types of training will be needed: Technical Train- 
ing and Personnel Orientation Training. Most of the training 
needed ; given in the Wash on area by CIA personnel, con- 
tractor personnel, and/or hardware manufacturer(s). 


Some participation, however, in training programs offered by 
other government agencies and academic and professional organize- 
tions will be required. ITEM {6 4g to cover the costs of such 
training. 


TIEM #7: Travel 


Included in this item are travel expenses to vieit contractor 
organizations, equipment manufacturers, out-of-town systems of 
interest to the DD/I, conventions, etc. 


Item #6: R & D Effort 


An R & D effort in areas of interest to intelligence ehould 
be sponsored by the Agency. This effort abould concern techniques 
and systems primarily, rather than hardware, and should complement 
and support developmental activities undertaken within the DD/I 
in establishing its computing program. 


TTEM #92 Site Preparation 


Provision for requisite electric power, air conditioning, 
and floor loads will constitute a modest site preparation effort 
in the New Building. Specific requirements will be provided by. 
the hardware manufacture after firm selection of equipment has 
been made. fk 
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Tmem #10: Space 


. The space requirements for the Computer Center will approximate 
the following: 


Computers "3,000 eq. ft. 
Office Space for Management, Pro- 
gramming and Syetem Personnel 4,000 aq. ft. 
Punch Room 1,000 eq. ft. 
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Magnetic Tape Storage 200 sq. ft. 
Paper Tape and Punch Card Storage 500 sq. ft. 
Engineering Maintenance 300 sq. ft. 
Air Conditioning | 1,000 aq. ft. 
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10,000 aq. ft. 
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